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This paper is a study of weighted counting of the solutions of acychc con- 
junctive queries (ACQ). The unweighted quantifier free version of this prob- 
lem is known to be tractable (for combined complexity), but it is also known 
that introducing even a single quantified variable makes it ^^P-hard. We 
first show that weighted counting for quantifier-free ACQ is still tractable 
and that even minimalistic extensions of the problem lead to hard cases. We 
then introduce a new parameter for quantified queries that permits to isolate 
large island of tractability. We show that, up to a standard assumption from 
parameterized complexity, this parameter fully characterizes tractable sub- 
classes for counting weighted solutions of ACQ queries. Thus we completely 
determine the tractability frontier for weighted counting for ACQ. 

1 Introduction 

Evaluating conjunctive queries is a fundamental problem from database theory. It is 
equivalent to evaluating so called Select-Project- Join queries and has several equivalent 
definitions, in particular, in terms of constraint satisfaction problems. While the prob- 
lem is known to be NP-complete [CM77] . a number of structurally restricted classes of 
conjunctive queries admit efficient algorithms. Among them, the class of acyclic con- 
junctive queries, ACQ for short, is a large and useful fragment which is well-known to 
be tractable (see |Yan81] but also [GLSOlj ) . The study of acyclic queries has also been 
the starting point of applications of hypergraph decomposition tools to query answering. 



*Partially supported by DFG grants BU 1371/2-2 and BU 1371/3-1. 



1 



Much less is known about counting solutions to database queries which is a basic 
operation of standard database systems, too, and thus also a very natural fundamental 
problem. As a generalization of #3SAT, it is easy to see that counting solutions to 
unquantified conjunctive queries is ^^P-complete. Mostly it has been considered as CSP 
with fixed constraint languages without restrictions on the structure of the formulas (see 
e.g. }DK10l IBDG+Tol IDGJOQj ). Recently, Pichler and Skritek [PgIT] showed that the 
restriction to quantifier free acyclic queries gives tractable instances for counting like it 
does for decision. This result also extends to other classes for which decision is known to 
be tractable like bounded hypertree width. Also, Mengel |Menll| showed that weighted 
counting on bounded treewidth, hence bounded arity, queries is easy, too. So in a sense 
if we do not allow quantification, not much changes if we go from decision to counting 
problems. 

This picture changes completely if we allow existential quantification of variables. 
While the complexity of the decision problem remains unchanged, it turns out that 
counting solutions of general conjunctive queries becomes much harder since it is com- 



plete for ^ ■ NP BCC"'"05 . Also the acyclic case is no longer tractable. Pichler and 
Skritek |PSllj showed that introducing one single existential quantifier allows construct- 
ing #P-hard instances of very restricted form. It follows easily that counting solutions 
to unions of a polynomial number of acyclic conjunctive queries is #P-hard, too, while 
the decision problem is tractable. This shows that counting and decision differ funda- 
mentally for acyclic conjunctive queries and that in order to find islands of tractability 
for counting we need new concepts that are tailored specifically to counting problems. 

The main contribution of this paper is to give a complete picture of the (combined) 
complexity of weighted counting problems related to acyclic conjunctive queries. We 
consider weighted counting problems associated to instances of quantifier free ACQ and 
present algorithms that compute arithmetic circuits which, in turn, can be evaluated to 
obtain the result of counting problems. Thus we separate structural manipulations of the 
CQ instance from the counting process itself. This separation is implicit in other papers 
(see e.g. |FMR08j ) and is made explicit here. In this context, we first prove (Theorem [3l 
generalizing jPSllj ) that computing the sum of the weights of solution^ of a (weighted) 
quantifier free ACQ can be done in polynomial time. Then, considering extensions of 
the problem above, we show that counting the number of solutions of a conjunction or 
a disjunction of two quantifier free ACQ is #P-complete even for Boolean domain and 
for fixed arity (see Proposition [5]). This last result and the fact that one quantification 
is enough to define hard cases (see [PSllj l shows that even "minimalistic" extensions of 
quantifier free ACQ lead to intractable counting problems and that one cannot hope to 
get any meaningful tractable class by these means. 

However, our second set of results counterbalances this impression. We show that a 
large subclass of ACQ with existentially quantified variables leads to efficiently solvable 
counting problems. To this aim, we introduce a (hypergraph) parameter, called quanti- 
fied star size, to measure the degree of dispersion of free variables in acyclic conjunctive 



^Provided the evaluation space itself admits efficient algorithms for multiplication and addition of 
weights (such as the field of rationals Q) 
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queries (this generalizes the notion of connex-acychcity of |BDG07j ). We show (see 
Theorem I16p that if this parameter is bounded by some constant A; € N, the resulting 
(weighted) counting problem is solvable in time vP'^^^ where n is the size of the instance. 
Furthermore, we show (see Theorem [T8]l that the quantified star size of a formula is 
efficiently computable making the counting result applicable. We also show that the 
runtime bound vP^^") can probably not be improved substantially in the sense that the 
parameterized problem is not fixed parameter tractable under the standard hypothesis 
FPT 7^ ^W[l] from parameterized complexity |FG041 IFG06) . Under the same hypoth- 
esis, we finally show (Theorem [25]) that quantified star size is optimal in the sense that 
any other structural restriction on acyclic conjunctive queries that leads to polynomial 
time counting of the solutions must imply bounded quantified star size. 

Our study of counting problem is mostly formulated in the slightly more general 
setting of arithmetic circuit complexity. So a consequence of our results is that they also 
provide information about the structural complexity of families of polynomials computed 
by arithmetic circuits and in particular on tractability and intractability in this context. 
As an example, it is an immediate corollary of our results that the polynomial families 
we consider which are built over bounded quantified star size queries characterize the 
class VP that formalizes efficient computation in the Valiant model |Val79] and which 
is well known for its apparent lack of natural non-circuit characterizations. Analog 
completeness results can also be derived from our hardness results too. 

Structure of the paper Necessary preliminaries about arithmetic circuits, conjunctive 
queries and acyclicity are given in Section [21 In Section [3] we show that the weighted 
counting of solutions to acyclic conjunctive queries is easy extending the result of [PSll] 
but also some results in [Menllj . In Section S] we show that unions of solutions of 
acyclic conjunctive queries are hard to count. Afterwards we turn to quantified star 
size to give a parameterization of counting solutions to quantified acyclic queries in 
Section [5l In passing, we prove that determining the quantified star size of a query (i.e. 
of the hypergraph associated to the query) can be done in polynomial time. Here we also 
show hardness in the sense of parameterized complexity and show that quantified star 
size likely is the most general restriction that leads to polynomial time counting. In the 
final Section [H] we apply our results to arithmetic circuit complexity: We extend Pichler 
and Skritek's hardness for quantified conjunctive query to the Valiant model and give a 
characterization of VP by acyclic conjunctive queries of bounded star size. 

2 Preliminaries 

Arithmetic circuit complexity An arithmetic circuit over a field F is a labeled directed 
acyclic graph (DAG) consisting of vertices or gates with indegree or fanin or 2. The 
gates with fanin are called input gates and are labeled with constants from F or 
variables Xi , X2 , ■ ■ ■ , Xn ■ The gates with fanin 2 are called computation gates and are 
labeled with x or +. The polynomial computed by an arithmetic circuit is defined in 
the obvious way: An input gate computes the value of its label, a computation gate 
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computes the product or the sum of its childrens' values, respectively. We assume that 
a circuit has only one sink which we call output gate. We say that the polynomial 
computed by the circuit is the polynomial computed by the output gate. The size of 
an arithmetic circuit is the number of gates. The depth of a circuit is the length of the 
longest path from an input gate to the output gate in the circuit. A circuit is called 
multiplicatively disjoint if, for each x-gate, its two input subcircuits are disjoint. 

Conjunctive query: decision, counting and weighted counting. We assume the reader 
to be familiar with the basics of (first order) logic (see [Lib04] ). If (/> is a first order 
formula, var((/)) denotes the set of its variables, by hee{(j)) C var((j)) the set of its free 
variables and atom((f)) the set of its atomic formulas. Let x = xi, we denote i;^(x) 

the formula with free variables x. The Boolean query problem $ = (5, (j)) associated to 
a formula (/>(x) and a structure S, asks whether the set 

0(5) = {a:(5,a)h<A(x)} 

called the query result is empty or not. The (general) query problem consists of com- 
puting the set (piS), while the corresponding counting problem is computing the size 
of (p{S), denoted by |i;A(5)|. When (p is a {3,A}-first order formula the boolean query 
problem is known as the Conjunctive Query Problem, CQ for short. It is well known 
that the Boolean CQ problem is equivalent to the propositional satisfiability problem 
and thus is NP-complete. We denote by #CQ the associated counting problem: given 
a query instance $ = (5, cp), return the value of |(/'(<S)|. 

Let now F be a field and 5 be a finite structure of domain D. A F-weight function for 
5 is a mapping w : D ¥. If a is a tuple of elements of D of length k, the weight of a 
is 

k 

w{a) = Y\_w{ai). 

1=1 

The weighted counting problem for CQ, denoted #fCQ, is the following problem: 
given $ = (iS, (p) and a F-weighted function w, return the sum of the weights of all 
solutions i.e. the value of 

ae(^(5) 

When w is the constant function 1, this value is clearly equal to |0(5)|. 

Query size and Model of computation. The size ||$|| of a query input is the sum of the 
size of the formula \(p\ and of the size |5| of S. We consider the size |5| of S with domain 
D to be the number of elements in D plus the number of tuples in relations of S. 

All algorithms below are expressed in terms of operations on tuples (without consider- 
ation on their size). Hence, we choose the RAM model (with addition as basic operation) 
under uniform cost measure as underlying model of computation. However, the choice 
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of a model will have some importance only in the few cases where precise polynomial 
time bounds are given. 



Acyclic Conjunctive Queries. A (finite) hypergraph 7^ is a pair {V, E) where V is 
a finite set and E C ViV). We associate a hypergraph % = iV^E) to a formula 
(the canonical structure describing (p) by setting V := var(i;^) and E := {var(a) | a £ 
atom((?!))}. 

Definition 1. A join tree (or tree structure) of a hypergraph % = {V, E) is a pair (T, A) 
where T = (Vr, T) is a tree and X is a function from Vr to E such that: 

• for each e £ E, there is at ^Vt such that \{t) = e, 

• For each v £V, the set {t £ Vr ■ v £ A(t)} is a connected subtree of T . 

A hypergraph is acyclic if it has a join tree |Fag83| . When there is no ambiguity, 
we often identify vertices of a join tree and their labellings. We also sometimes only 
specify the tree T explicitly, without talking of A which is always implicitly understood 
to exist. A formula (j) is acyclic if its associated hypergraph is acyclic. Considering 
acyclic {3, A}-first order formulas yields the Boolean acyclic conjunctive query problem 
denoted ACQ. We denote by #ACQ (resp. #fACQ) the associated counting (resp. 
F-weighted counting) problem. If is such that free(0) = var(0)) then (p is said to be 
quantifier-free. 

Any a € (piS) will be alternatively seen as an assignment a : var(i?^) — )• or as a tuple 
of dimension |var(0)[. Two arbitrary assignments a and a' are compatible if they agree 
on their common variables. We will make use of the following classical join operators. 

Definition 2. Let (/>(x, y), 'ip{y,z) be two conjunctive queries with xflz = and let 
TZ, S be two finite structures. Let us define: 

- The natural join: (p{Tl) f>^ip{S) = {(a, b, c) : (a, b) € 0(7^) and (b,c) S ip{S)} 

- The special (left) join: 0(7^) 1X^(5) = {(a, b) : (a, b) € piJZ) and there exists (b, c) G 
i;{S)}. 

When TZ = S, (j){TZ) cxi tpiS) is simply [p A Tp]{7l). The natural join between two 
relations can be computed in time linear in the size of the relations (here in time linear 
in \4>{']Z)\ and |^(5)|). 

Remark 1 (CQ and CSP). A well-known equivalent formulation of the quantifier-free 
conjunctive query problem can be stated in terms of CSP. In this later problem, given 
two structures S and T, one asks whether there exists an homomorphism from S to T. 
In the recent past, counting and weighted counting for CSP have been mainly stated in 
the non uniform version of the problem i.e. when the template T is fixed and only S 
IS given as input (see e.g. IDBld \BDG+1(A \DG.T09f ). In this paper, we focus on the 
uniform version of the problem. 
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Polynomials defined by conjunctive queries We briefly introduce a polynomial Q that 
generalize #CQ. A more thorough discussion of Q can be found in [M en.l 1 J . 

Let $ = (0,5) be a query with domain D. We assign to $ the following polynomial 
<5($) in the variables {Xd [ d G D}. 



where /^^(a) = \{x € var((/>) | a{x) = d}\ computes number of variables mapped to d by 
a. Note that the number of variables in Q{^) is \D\, the size of the domain, and that 
Q{^) is homogeneous of degree |var(<l>)|. 

Observe that Q is essentially the weighted counting problem #fCQ by setting := 
w{d) for all d S Z). Thus if we can efficiently compute circuits that in turn compute 
Q{^), we can efficiently solve #fACQ on the instance <I>. 

Parameterized counting complexity This section is a very short introduction to pa- 
rameterized counting complexity (for more details see |FG041 IFG06j ) . 

A parameterized counting problem is a function F : S* x N — ?> N, for an alphabet S. 
Let (x, A:) G E* X N, then we call x the input of F and k the parameter. A parameterized 
counting problem F is fixed parameter tractable, or i*" € FPT, if there is an algorithm 
computing F{x, k) in time f{k) ■ \x\'^ for some computable function / : N — )• N and some 
constant c € N. 

Let F : S* X N — > N and G : 11* x N ^ N be two parameterized counting problems. A 
parameterized parsimonious reduction from F to G is an algorithm that computes for 
every instance {x,k) of F an instance {y,l) of G in time f{k) ■ \x\'^ such that / < g{k) 
and F{x, k) = G{y, I) for computable functions /, (7 : N — )■ N and a constant c E N. A 
parameterized T-reduction from F to G is an algorithm with an oracle for G that solves 
any instance (x, /c) of F in time f{k) ■ \x\'^ in such a way that for all oracle queries the 
instances {y,l) satisfy / < g{k) for computable functions f,g and a constant c G N. 

Let p-#Clique be the problem of counting /c-cliques in a graph where k is the parameter 
and the graph is the input. A parameterized problem F is in #W[1] if there is a 
parameterized parsimonious reduction from F to p-#Cliqu^. F is #W[l]-hard, if there 
is a parameterized T-reduction from p-#Clique to F. As usual, F is #W[1] -complete 
if it is in #W[1] and hard for it, too. 

A standard assumption from parameterized complexity is that not all problems in 
#W[1] (and thus in particular the complete problems) are fixed parameter tractable. 
Thus, from showing that a problem F is 7^W[l]-hard it follows that F can be assumed 
to be not fixed parameter tractable. 

Except for these definitions we will not use parameterized parsimonious reductions 
and we will not use the complete power of parameterized T-reductions either. Instead, 

■^Let us remark that Thurley |Thu06) gives good arguments for defining #W[1] not with parsimonious 
reductions. He instead defines #W[1] with parameterized T-reductions with only one oracle call. 
We keep the definition of [FG04I IFG06] . because we will show no 7^W[1] upper bounds and thus 
can avoid these subtleties. We remark though that finding the right reduction notions for counting 
problems is notoriously tricky to get right (see e.g. [KPZ99I IDHK05] !. 




d 
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all parameterized reductions in the remainder of the paper will be T-reductions with 
exactly one oracle call. 

3 Constructing circuits for acyclic conjunctive queries 

Theorem 3. Given an acyclic quantifier free conjunctive query <I>, we can in time polyno- 
mial in ||<I>|| compute a multiplicatively disjoint arithmetic circuit C that computes Q{^). 

Proof. The first step follows that of |Menll] . Since arity of queries is not bounded, a 
new approach is necessary to show that only polynomially many gates are necessary to 
compute Q{^). Also the algorithmic nature of the construction is stressed more to give 
the upper bound on the complexity of constructing the circuit C. 

So let $ = (5, (j)) be an acyclic conjunctive query. Let (T, A) the join tree associ- 
ated with (p. By definition, the tree T has m vertices ti,...,tm associated to the atoms 
A(ti), ...,X{tm) in (j). Observe that T can be constructed from cp in polynomial time; in- 
deed it can even be computed in logarithmic space, see [GLSOlj . Thus we do not consider 
the construction of T but take it as given. For t € Vr, we call <j)t the conjunction of con- 
straints corresponding to the subtree Tt with t as root. The set var((/)t) = Ut'eT var(A(t')) 
is denoted by et- For convenience we also denote the atomic formula X{t) by At. 

Let a be an assignment of some variables of (p and c C var((/)t). We show by induction 
on the depth of T that the following polynomial can be computed by an arithmetic 
circuit of polynomial size 

/t,a,c = ^ -'^a{a;) • 

ae<l>t(S) a;ec 

a~a 

Remark that if r is the root of T then, fr,0,var{(f>) = Q{^)- Observe that in contrast 
to [Menllj we have an exponential number of polynomials ft,a,c, so we cannot afford 
to compute them all in a bottom up fashion. Listead we will construct the circuit top 
down starting from r and make sure that in each step only polynomially many ft,a.,c are 
needed. This will directly give the runtime bound for the construction. 

So suppose first that t is a leaf. Then, <j)t is some atomic constraint Aj, hence (pti<S) is 
of size at most linear in \S\ < ||<&||, so the sum only involves a number of terms linear 
in ||<I>||. 

Suppose now t G Vj- is not a leaf and let ti, ...,tk in Vr be the children of t in T. Let 
Co, ci, Cfc be a partition of c into disjoint sets such that each Cj C fl c, for i = 1, ...,k 
and Co C c\ Ui=i (the need for choosing a partition is that each variable x appearing 
in several X{ti) sets must be taken into account at most once in order not to overcount 
the exponent of X^^^^-^). 

ft,a,c = ^ Y\. ^a{x) 
a&<t>t{S) 2^Gc 

= 5^ n ^"(^) 

o~a 



x£Ck xGco 
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Let At = {{\t{S) K ^tAS)) ^ <i)t2{S)) K . . . K ^t^{S). Note that At C \t{S). 
Claim 1. The set At is computable in time \S\ x 10^1. 

Proof of the claim. By induction on the tree depth. The proof is an adaptation of Yan- 
nakakis algorithm to evaluate acyclic conjunctive queries (see j YanSlj ). Note that At is 
a subrelation of (ptiS) with (pt being a constraint of the input formula. So if t is a leaf, 
the result is obvious. If t is not a leaf. Let ti, ...,tk be its children. Remark that 

At = {Xt{S) K At,) tK At;) K . . . >< ^f,, 

since each At^ is the projection of (pui'S) onto var((/)t-). The set At is computed as follows: 
Special joins are computed step by step respecting the order given by parentheses. To 
compute each join sort the two sets of assignments using the lexicographic ordering 
induced by the variables they have in common. Then, run once through the two sorted 
relations to select the right tuples. Suppose now that each At- is computable in time 
|5| • \(j)ti\- Then, At is computable in time |5| • {\Xt\ + \(j)ti | + • • • + {(pt^l) ^ I'^l ' \4't\- D 

Each solution a € can be uniquely expressed as the natural join of a tuple 

(3 & At and a sequence of Oi G cpti{S), i = 1, ...,k, compatible with /3 (more formally by 
natural join of singleton relations containing these tuples), i.e. given a G (pti^), there 
exist P E At and G cpt^iS), i = 1, k, such that 

{a} = {/S} M {ai} ixi . . . CXI {ofc}. 

Conversely, given /3 G At and a sequence of Oj G cpt^iS), i = 1, A:, compatible with 
/3, the natural join of these tuples is an a G (t>t{S). This follows from the connectedness 
condition in the join tree, i.e. from the fact that given distinct i,j < k, ya\r{(j)t-) D 
var^cptj) ^ var(At). Indeed, if ai and aj assign values to of a common variable, they must 
agree on it, because they both agree with /3. This implies that the following equalities 
hold. 

/i,a,c = n ^"(^) "'11 ^"(^) n ^"(^) 
= J2 12 ■■■ Yi n ^"iw" n ^"feW n 

= Y /ii,/3,ci • • • /tfc,/3,Cfc • Y\. 

Note that the sum is now over At and not over (j)t{S) anymore. We claim that the 
construction described above can be done in polynomial time. Indeed, for each t we 
only have to compute the ft,a,c for one fixed set c but for potentially all a G Af where 
t' is the father of vertex t in T. Thus for t we only have to compute \At'\ < \S\ < \\^\\ 
polynomials ft,a.,c- Furthermore for t we only have to access the polynomials for ti, ...,tk, 
more precisely all of fti,i3,ci, ftk,lB,c,, with /3 G At. So the computation of one /j,a,c 
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involves only 0(|At| x (/c + |co|)) = 0(|iS| x \(p\) arithmetic operations. Computing the 
ft,a,c for all a € At' but fixed t can then be done with 0{\(j)\ x operations. Summing 
up over all t we get a total upper bound of 0(|(/>P x so the circuit C for Q($) is 

of polynomial size. Now remark that each set At, for t gVt can be constructed in time 
0{\4>\ X \S\) by Claim [TJ For a fixed a, filtering all elements /3 of At compatible with a 
can be done in linear time after sorting At- Hence, the index set of each sum is efficiently 
computable and the construction of the circuit can be done in polynomial time. 

In a final step we apply the construction of Malod and Portier |MP08) to make the 
circuit multiplicatively disjoint. □ 

We get the following corollary on weighted counting problems (which generalizes a 
recent result of Pichler and Skritek |PSllj ) . 

Corollary 4. Let ¥ be field such that iterated addition and multiplication are computable 
in polynomial time in F. Then ^ipACQ can be solved in polynomial time for quantifier 
free queries. 

Proof. Given an instance <I>, Theorem [3] yields a circuit C that computes Q{^). Setting 
Xd = w{d) for all d £ D, we can evaluate Q($) to give the answer to the weighted 
counting problem. Observe that efficient evaluation is possible, because the degree of 
Q{^) is bounded and thus we can use standard depth reduction techniques to avoid a 
blowup of the size of representations of field elements. □ 



4 Union and intersection of acyclic queries 

In this section, we show that considering conjunction and disjunction of two acyclic 
conjunctive queries leads to intractable counting problems. 

Proposition 5. Computing the size of the union and the intersection of solutions to 
two quantifier free ^ACQ-instances are both -complete. This result remains true for 
#ACQ on boolean domain and arity at most 3. 

Remark 2. In JGSSOlf . it is proved that the (bi-)colored grid homomorphism problem 
is ISiF-complete. This result implies part of Proposition i.e. that counting the as- 
signments of the conjunction of two ACQ-instances is -complete (the fact that this 
hardness result is still true on Boolean domain does not follow, however). 

For the proof we use the following lemma: 

Lemma 6. Counting solutions to quantifier free conjunctive queries whose primal graph 
is a grid is ^P-complete even for domains of size 4. 

Proof. Counting solutions to general quantifier free conjunctive queries is in #P, so 
we only need to show hardness. We show hardness by reducing a restricted version of 
^circuitSAT to #CQ with the desired grid structure. From the ^P-completeness of our 
#circuitSAT version we get #P-hardness for counting solutions of conjunctive queries 
with grid structure. 
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We now define this version of T^circuitSAT tliat we call T^(A--i-grid)-circuitSAT: An 
instance of #(A--i-grid)-circuitSAT is a boolean circuit which only contains A- and -i- 
gates and in which all gates are vertices of a 2-dimensional grid. Furthermore, the edges 
of the circuit are non-intersecting paths along the edges of the grid. 

Proposition 7. #(7\--i-gridj-circuitSAT is ^P- complete under parsimonious reduc- 
tions. 

Proof. We make a parsimonious reduction from ^circuitSAT. Let C be a ^^circuitSAT 
instance, i.e. a boolean circuit. In a first step we substitute all V-gates xVy by ^(^.xA^y). 
We then make sure that every gate has at most degree 3 and that all input gates and 
the output gate have at most degree 2 by adding double negations. Call the resulting 
circuit C 

We now embed C into a grid. To do so we take a three step approach that starts 
with a coarse grid that is then refined. Let n be the size of C. We first distribute the 
vertices that represent gates into a n x n-grid Gi such that each vertex of depth i has 
the coordinates {i,j) for some j. Furthermore each edge of the circuit is a sequence of 
straight lines where each straight line goes from a vertex in one row to another vertex 
in the next row. Also in each vertex of Gi at most two lines start and end. For vertices 
on which no gate of C' lies, we assume that at most one edge starts and ends. It is clear 
that such an embedding can be constructed easily. 

In a second step we make sure that the edges of the circuit follow the edges of a grid 
without congestion. We do this for each row of the coarse grid Gi individually. We 
construct a new grid G2 by adding 2n — 1 new rows before each row in Gi and one new 
column before each column. Observe that each vertex {i,j) in Gi has the coordinates 
{2ni,2j) in G2. Each vertex v of Gi in row i has a most 2 outgoing straight lines ^1,^2 
representing edges of the circuit C' which both end in a vertex of row i + 1. Let h end 
in [i + 1, j) and I2 end in (i + 1,/) with j < j', then we call h be the low output and I2 
the high output. If there is only one output, we define it to be high. We also make the 
equivalent definition for high and low inputs. 

Now we substitute the lines representing edges of C' by paths in G2. Let / be a line 
that starts in Gi in {i,j') and ends in (i + We construct a path Pi from (2ni,2j) 
to (2n(z + 1),2/): 

• If Z is a low output and a low input the path is the piecewise linear curve through 
the vertices (2m, 2j)(2m, 2j - l)(2m + 2j, 2j - l)(2m + 2j, 2j'){2n{i + 1), 2/). 

• If Z is a high output and a low input the path is through {2ni,2j){2ni + 2j + 
1, 2i)(2m + 2j + 1, 2j'){2n{i + 1), 2/)- 

• If Z is a low output and a high input the path is through {2ni, 2j){2ni, 2j — l)(2ni + 
2j, 2j - l)(2m + 2j, 2j' + l){2n{i + 1), 2/ + l){2n{i + 1), 2/). 

• If Z is a high output and a high input the path is through {2ni,2j){2ni + 2j + 
1, 2i)(2m + 2j + 1, 2f + l){2n{i + l),2j' + l){2n{i + 1), 2/)- 
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Figure 1: The crossing paths in the left are substituted by a gadget without crossings 
in the middle that uses ©-gates which compute xor of its inputs. It is easily 
checked that the outputs compute (a © 6) © a and (a © 6) © 6 which simplify to 
b and a respectively. On the right we show how the ©-gates can be simulated 
over the basis A,V,-' without losing planarity. Degree 4 gates, splitting of 
edges and V-gates can be avoided by introducing some more -i-gates and using 
De Morgan's law. 

The result is an embedding of C into a grid such that the gates are on vertices of G2 
and the edges of C are paths in the grid. Observe that the paths were constructed in 
such a way that two paths between gates never share edges, so they only intersect in 
single vertices. 

In the final step of the reduction we get rid of these intersections on non-gate vertices 
by adding additional gates. Each crossing in G2 is substituted by the gadget illustrated 
in Figure [TJ To do so we make the grid finer again by a constant factor. The result 
is a circuit C" that is embedded into a grid. Furthermore C" has the same satisfying 
assignments as C. 

□ 

Remark 3. We could also have given a proof of Proposition^ with results on embedding 
general planar graphs into grids in the way we need it (see e.g. IVaWlf ). We have chosen 
to present an ad-hoc proof instead to keep the results of this paper self-contained. 

We now reduce #(A--i-grid)-circuitSAT to #CQ instances of grid structure. So let 
(C, G) be an instance of #(A--i-grid)-circuitSAT, i.e. a circuit G that is embedded into a 
grid G. Let G be of size nxn. W.l.o.g. we may assume that no gates are on neighboring 
vertices in G and that the output gate is not a A-gate. For each A-gate af G we arbitrarily 
fix one input as the first input while the other one is the second one. We construct a 
binary conjunctive query $ whose primal graph is G. The domain is {0, 1, 2, 3} where 
and 1 represent the usual boolean values while 2 and 3 are used in a gadget construction 
for A-gates. For each edge e = uv in G we add a constraint tpe in the variables u and v 
in the following way: 

• If e is not an edge of C, 0e has the satisfying assignments {ab | a, 6 S {0, 1, 2, 3}}. 

• If e is an edge of G directed from u to v and v is not a gate and u is not a A-gate, 
(pe has the satisfying assignments {00, 11}. 
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• If e is an edge of C directed from n to u and f is a -i-gate, 0e has the satisfying 
assignments {01,10}. 

• If e is an edge of C directed from n to v and u is a A-gate and the path to v over 
u is from the first input of v, (pe has the satisfying assignment {00,02, 11, 13}. 

• If e is an edge of C directed from n to f and f is a A-gate and the path to v over 
u is from the second input of v, (pe has the satisfying assignment {00,01, 12, 13}. 

• If e is an edge of C directed from n to f and v is not a gate and u is a A-gate, (pe 
has the satisfying assignments {00, 10, 20, 31}. 

Observe that the construction near the A-gates is possible, because no two gates are 
neighbors. So the constraints are ah well defined. Now each vertex that is not part of C 
gets a unary constraint that has only the single satisfying assignment 1. Also the output 
gate of C gets such a unary constraint. 

We claim that if we fix an assignment a to the variables representing the inputs 
of C, there is an satisfying extension to the other variables if and only if a satisfies C. 
Furthermore, this extension is unique. It is clear that the constraints along the paths and 
on the -i-gates propagate the correct values along the grid. In a satisfying assignment, 
the variable representing an A-gate has to take the value representing the values of its 
inputs in binary. The gates after the A-gates then calculate the conjunction value for 
these inputs. □ 

Proof of Proposition O Again, we only need to show hardness. By the inclusion-exclusion 
principle counting for unions and intersections is equally hard, so it suffices to show 
hardness for intersections. The reduction is straightforward with Lemma [6j Let $ be a 
conjunctive query whose primal graph is a grid. We separate the constraints into two 
new formulas: $i gets all the constraints that lie on rows of the grid, $2 gets those on 
the columns. Clearly we have $ = $1 A $2 and the $i are acyclic. Thus the first part 
of the lemma follows. 

To show that the result is true for queries on boolean domain, we sketch a differ- 
ent encoding of A--i-grid-#circuitSAT into conjunctive queries. Roughly speaking, the 
structure of the encoding is basically the same but non boolean elements are mapped 
to sequences of boolean variables (that represent their binary encodings). To do so we 
need ternary relations. For completeness, details are given below. Again let G be the 
n X n grid and suppose no gates are on neighboring vertices in G and that the output 
gate is not a A-gate. For each A-gate v, we introduce a second vertex/variable vi. We 
construct a ternary CQ instance <I> as follows. For each edge e = uv in G we add an 
constraint (pe in the following way: 

• If e is not edge of C , (pe has the satisfying assignments {00,01, 10, 11}. 

• If e is an edge of G directed from u to v and v is not a gate and u is not a A-gate, 
(pe has the satisfying assignments {00, 11}. 
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• If e is an edge of C directed from n to u and u is a -i-gate, (pe is the constraint 
with the following satisfying assignments {01, 10}. 

• If e is an edge of C directed from u to v and u is a A-gate and the path to v over 
u is from the first input oi v, (pe is the ternary constraint on variables u,v,vi with 
the following satisfying assignment set {000,010, 101, 111}. 

• If e is an edge of C directed from n to u and f is a A-gate and the path to v over 
u is from the second input of v, (j)e is the ternary constraint on variables u, v, vi 
which has the satisfying assignment {000, 001, 110, 111}. 

• If e is an edge of C directed from u to f and v is not a gate and u is a A- 
gate, (pe is the constraint on variables ui,u,v which has the satisfying assignments 
{000,010,100,111}. 

The constraint is then split into two conjunctive query instances $i and $2 as above 
grouping horizontal and vertical constraints separately. Note that, connection at gates 
V between hyperedges is now on two vertices v and vi. But the resulting hypergraphs 
for $1 and $2 are well acyclic. □ 

The reductions of this section are all parsimonious, so we directly get the following 
corollary H: 

Corollary 8. Deciding if the intersection of the solutions of two quantifier free acyclic 
conjunctive queries is nonempty is 'NF -hard. 

Note that in contrast, it is well-known that deciding the disjunction of acyclic con- 
junctive queries can be done in time linear in H^"!!. 

5 Quantified star size 

5.1 Definitions and statement of the results 

It is proved in [PS11| that introducing one single existential quantifier in acyclic con- 
junctive queries leads to #P-complete problems. So bounding the number of quantified 
variables does not yield tractable instances. In this section, we will show that not the 
number of quantified variables is crucial but how they are distributed in the associated 
hypergraph. A basic observation on the hard instance in [ PSllj is that the formula has a 
star (in the graph theoretical sense) in its associated graph whose center is the quantified 
variable. Abstracting this observation, we introduce a parameter called quantified star 
size that leads to tractable #ACQ instances. 

Before we formulate the main results of this section, we make several definitions. 

Let H = {V, E) be a hypergraph and C y. The induced subhypergraph HIS] of 7i is 
the hypergraph HIS] = (5, {e fl 5 | e G -E, e fl ^ 0}). The induced subhypergraph of 

^We state this corollary for completeness. Although we found no references, it is certainly already 
known 
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an edge set E' C E is HlE'] = (UegB' x,y ^ V , a path between x and ?/ is a 

subset of edges ei, ek & E such that x G ei, ?/ G e^, and for alH < /c — 1, H ej+i 7^ 0. 
Alternatively, a path can be seen as the sequence of vertices (x, ai, a^, y) such that 
aj G ej n ej+i. These definitions apply to graphs as well. 

Observation 9. IfTi is an acyclic hypergraph and C C V[H), then T-i[C] is acyclic. If 
T = {Vr,T) is a join tree of H then T[C], the tree obtained from T by replacing for all 
vertices t G V{T) the labeling X{t) by X{t) CiC, is a join tree ofHlC]. 

Proof. Immediate. 7~[C] is a subforest of T. The connectedness condition of the set 
{t G Vx[c'] '■ G A(t)}, for all u G C is obviously true. □ 

Definition 10 (5-component |BDG07) ). Let % = {V.,E) be a hypergraph and 5" C 1/. 
Let E^g be the set of hyperedges {e £ E : e <^ S} . The 5-component of e £ E^g is the 
hypergraph T-LlE'] where E' is the set of all edges e' G E(^g such that there is a path from 
e-S toe' -S inn[V -S\. 

It is clear that hyperedges of E^^g are partitioned into pairwise disjoint 5-components. 

Definition 11 (S-Zc-star, S'-star size). Let T-i = {y.,E) be a hypergraph, C 1/ and 
A; G N. The subhypergraph %' = iV' ,E') of T-i is a S-k-siai if: 

• %' is an S-component ofH. 

• there exist yi, ...,yk G V D S such that there is no edge e E that contains more 
than one of the yi. 

We say that yi, ■ ■ ■ ,yk form the S-k-star. 

The S-star size of 7i is the maximum k such that there is a S-k-star in T-i. 

In other words, the S'-star size of a hypergraph % is the maximal star size of its 
S'-components. 

Observation 12. Let % = {V^E) be a hypergraph, 5" C y and H' be an S-component 
ofJi. Then, ifH is acyclic, %' is acyclic. 

Proof. Let T be a join tree of %. An S'-component %' = {V',E') is a subhypergraph 
induced by the edge set E'. By definition E' is connected in G and it follows that 
{t \ X{t) € E'} induces a subtree T' of T. The connectedness condition holds in T and 
thus it holds in T', too. It follows that T' is a join tree of Ti' and Ti' is acyclic. □ 

Definition 13. The quantified star size of a acyclic conjunctive formula (/)(x) is the 
S-star size of the hypergraph Ti associated to (/>(x), where S is the set of free variables 
in 0(x). 

Example 14. The formula (j){x, y) = 3t3zR{x, y, t) A S{x, z, t) has quantified star size 1 
because the free variables x and y appear together in one constraint. 

Paths formulas (of arbitrary length), for example (j){x, y, z) = 3ti3t2R{x, ti)AR{ti, z) A 
R{z,t2) A R{t2,y), are of quantified star size 1. 

Star formulas, such as (j){x, y, z, t) = 3uR{u, x) A R{u, y) A R{u, z) A R{u, t) have quan- 
tified star size equal to the degree of the center of the star (here A). 
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Example 15. The hard formula of JPSll^ is of quantified star size n, the size of the 
structure domain. 

We now formulate the main results of this section. The first result is that bounding 
the quantified star size yields tractable counting problems. 

Theorem 16. There is an algorithm that given an acyclic conjunctive query <I> computes 
an arithmetic circuit C that computes Q{^). The runtime of the algorithm is 
where k is the quantified star size of ^ . 

Corollary 17. There is an algorithm for the problem ^ACQ that runs in time \\^\\^^^^ 
where k is the quantified star size of the input query <1> . 

The second result below implies that computing the quantified star size is easy and 
thus classes of #ACQ-instances of bounded quantified star size are efficiently decidable. 

Theorem 18. There is a polynomial time algorithm that, given a hypergraph % = {V, E) 
and 5 C y, computes the S-star size ofH. 

We prove Theorem 1161 and Theorem 1181 in the following two subsections. 
5.2 Computation of 5-star size 

In this section we show that 5'-star size can be computed in polynomial time. 

Let Ti = {V,E) be a hypergraph and S <Z V. We say that E* (Z E covers S if 
•S" C IJgg^;, e. If = y we say that E* is an edge cover of H. An independent set / in 
■H is a set I '^V such that there are no distinct vertices x,y I that lie in a common 
edge e G £'. 

Lemma 19. For acyclic hypergraphs the size of a maximum independent set and a 
minimum edge cover coincide. Moreover, there is an algorithm that given an acyclic 
hypergraph % computes a maximum independent set I and a minimum edge cover E* of 

n. 

The first sentence in Lemma [19] can be seen as an adaptation of Konig's theorem for 
bipartite graphs (see e.g. jBol98j ) to acyclic hypergraphs. The proof uses a minimally 
modified version of an algorithm that Guo and Niedermeier |GN06] describe to compute 
minimum (unweighted) edge covers of acyclic hypergraphs. We show here that their 
techniques cannot only be used to compute minimum edge covers but also maximum 
independent sets of acyclic hypergraphs. 

Proof. Clearly the size of any independent set is not greater than that of any edge cover, 
simply because no edge can cover two vertices in an independent set. So if we present an 
algorithm that computes an independent set / and an edge cover E* of a given acyclic 
hypergraph H = {V,E) such that \I\ = \E*\ we are done. 

So let us now describe an algorithm that computes / and E*: Let T = {Vt,Et) be a 
join tree of Ti with root r. We start with initially empty sets / and E* and iteratively 
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delete leaves of T in a bottom-up manner from the leaves to the root. It is easily seen 
that for each leaf t G Vr, either A(t) C A(t') where t' is the parent of t or there exists 
y € A(t) such that y A(t'). In this case, we will say that y is unique for t. If t = r is a 
leaf of T, i.e. r is the only vertex in T, we say by convention that if A(t) contains any 
vertices, they are all unique for t. 

We do the following until Vr is empty. First, choose a leaf t of T. If there is no vertex 
unique for t, we simply delete t from Vr- If there are vertices that are unique for t, 
choose one vertex y among them and add it to /. Furthermore, add A(t) to E* , delete 
all vertices in A(t) from Ti and delete t from Vr- When Vr is empty, / and E* are the 
result of the algorithm. 

For a vertex t G we denote by % the subtree of T with the root t. Let furthermore 
Vt be defined as the vertices in V that appear only in {A(t*) j t* G V{7t)} C E and in 
no other edge in E. 

Claim 2. Whenever the algorithm deletes t G Vr, the edge set E* covers the vertices Vt- 

Proof. Assume that the claim is false, then there is a first vertex t G Vr met during the 
execution of the algorithm for which after t is deleted some vertex y G Vt is not covered 
by E* . For all children t* of t the vertices in Vt* are covered by E* , so y must lie in A(t). 
But then y is unique for t before t is deleted. Thus A(t) is added to E* and y is covered 
by X{t) G E* after t is deleted which is a contradiction. □ 

From Claim [2] it follows directly that E* is an edge cover at the end of the algorithm. 
Claim 3. At each point in time during the algorithm I is an independent set in G. 

Proof. Assume the claim is wrong. Then, there is a first vertex y that is added to / 
such that y is adjacent to x already in /. The vertex x was added to /, so there was 
t £Vt such that x was unique for t when t was considered by the algorithm. Thus x is 
in Vt and consequently not in A(i') for any vertex t' G Vr \ V{Tt). Hence, if x and y are 
adjacent, there must be a vertex t* G V{Tt) such that {x,y} C X{t*). But y is added 
to / after x and thus it must appear in A(t') for a vertex t' £ Vt \ V{Tt). Then because 
of the connectedness condition and the fact that T is a tree, y must also be in A(t) and 
thus is deleted from T-L when t is deleted. But then y cannot be added later which is a 
contradiction. □ 

With Claim [5] and Claim [3] we have that at the end of the algorithm E* is an edge 
cover of G and / is an independent set in G. It is easy to see, that \E*\ = This 
completes the proof. □ 

Corollary 20. Let % = {V, E) he an acyclic hypergraph and S" C y. Then the following 
statements are true: 

a) The S-star size of H can be computed in polynomial time. 

h) Let %' = (y' ,E') he an S-component ofH and let k be the S-star size ofH'. There 
is a polynomial time algorithm that computes an edge set E* C E' that covers S{^V' 
and \E*\ = k. 
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Proof, [aj) Let Hi, . . . ,7^m be the 5-components of Ti. By Observation [121 each Hi = 
{Vi,Ei), z € {1, . . . ,m}, is acychc and then by Observation [9l Hi[S] is acycUc too. By 
Lemma [T9l for each i G {1, . . . , m}, one can determine the size of a maximum indepen- 
dent set li of Hi[S]. But we claim that for each i the star size Sj of the S-component 
Hi and the size of the maximum independent set in Hi[S] coincide. Indeed, consider 
two vertices x,y € S nVi such that there is an edge e £ E \ Ei such that {x, y} £ e. 
Note that x and y are each included in at least one edge of Ei. Remark also that 
{Vi,Ei U E[Vi n S]) = H[V^ where E[V^ n S] := {e n {Vi n S) \ e £ E,e n {Vi n S) ^ 0}. 
H[Vi\ is acyclic by Observation [9l Let 7^' be a join tree of The vertices {t € 

V{Tl) I A(t) € Ei} are connected in ?^[Vi], so they induce a subtree Ti of 7]'. But then, if 
e E E\Ei, by the connectedness condition the vertex t with A(t) = e must be connected 
to Ti by two different paths that enter % via two different edges (since both x and y are 
in distinct edges of Ei). This contradicts the fact that Tl is a tree. Thus Sj is indeed the 
size of a maximum independent set made of S- vertices in H\yi\ which is the ^-star size 
of Hi- The S-star size of H is then the maximal value among si, and the result 
follows. 

|b]) We compute an edge cover E of size k for H'[S] with Lemma [THl Then for each 
edge e £ E one can easily find an edge e £ E' with e Q e. □ 

5.3 Efficiently computing the Q-polynomial 

We now have the necessary ingredients to prove Theorem 1161 

Proof of Theorem[T^ Let ^ = (<S,0) be an input query of quantified star size k. We 
will construct a quantifier free formula if and a new structure S' in time ||$||'^(^) such 
that 4>{S) = ^{3'). 

Let H be the hypergraph of (p and S the set of free variables of (p. Let H' = {V, E') be 
an S'-component of H and let (j)' be the subformula of (j) whose atomic formulas are the 
hyperedges of E' . The formula (j) can then be written as a conjunction (j)' A iJj, where the 
formula ip contains all the atoms of (f) not in (p' . By the definition of S'-components we 
have (var((/>')\free((;/)')) yar{ilj) = 0. In other words, the quantified variables in (p' only 
appear in atoms of (p' and common variables of (p' and ip are necessarily in S, i.e. they 
are free. 

Let now E* Q E' he a cover of y fi S" of size s < k computed with Corollary [20j 
Let (pi, . . . ,(ps be the atomic formulas associated to edges in E*. We will compute (p'{S) 
and construct in parallel a new atomic constraint ipi and a new relation ipf such that 
(pf = (p'{S). The set of variables var(93i) is IJi=i f''66(</'«) = fi'ee((/)'). For each combination 
tl, . . . ,ts of tuples in (pi{S), . . . , (ps{S) we add the tuple t := ixi . . . ixi to the relation 
(^f if 

• the tuples ti, . . . ,ts are consistent, i.e. they coincide on shared variables, 

• the ACQ instance that we get from (p' by fixing the variables in var((/?i) to the 
values specified by the tuples ti, . . . ,ts is satisfiable. 
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Observe that we have to only consider Y[i=i < ^ 11*^11'^ combinations 

ti, . . . ,ts and the resulting Boolean queries can each be evaluated in time O(|0'| • by 
Yannakakis' algorithm (see |Yan81j ). Thus the construction of cpi and the relation ipf 
can be done in time ||<^|p('^\ Let us call S' the union of the structure S and ipf. We 
then have a new query ipi Aip such that [ifi A ip]{S') = (j){S). 

Let T be a join tree of (j). We can choose a subtree T' of T such that T' is a join 
tree of T-L' by considering the induces subgraph of the vertices {t \ 3e G E',X{t) = e}. 
Also free((?!>') = V D S = var(93i) and recall that (var((/)')\free(0')) n var(V') = 0. Hence, 
contracting T' into a single node whose label is the constraint ipi, results in a join tree 
of the formula (pi A ^. Thus this latter formula is acyclic. 

We iterate this process with the S'-components of the subformula ip. When each S- 
component has been treated, (p is replaced by a quantifier free formula = A . . . A 
'Pm A 4>o where each (pi for i = 1, . . . , m is atomic and m is the number of S-components 
of (p. Furthermore, 4>q is the conjunction of all atomic formulas of (p that contain only 
free variables. Also var((/?) = S. Similarly S is replaced by a structure S' of size bounded 
by ||<l>|p(^) (recall that each component is treated separately) such that (p{S) = (p{S'). 
In each iteration step the formula stays acyclic. Hence, (p is acyclic and we conclude by 
applying Theorem [3j □ 

5.4 #W[l]-hardness of parameterized #ACQ 

In this section we show that several parameterized versions of quantified 7^ ACQ are not 
fixed parameter tractable under standard assumptions from parametrized complexity. 
We consider the following parameterized counting problems: 

• p-star-T^ACQ: counting parameterized by the quantified star size, 

• p-var-#ACQ: counting parameterized by the number of free variables, 

• p-#ACQ: counting parameterized by the size of the conjunctive formula. 

Clearly, for every ACQ instance <I> with formula p we have that the quantified star size 
is at most |var((/9)| < \p\. Thus we get from p-var-^ACQ and p-^pACQ might be easier 
than p-star-^ACQ. The next lemma states that - unless there is a severe collapses in 
parameterized complexity ~ all three problems are not fixed parameter tractable. This 
is in contrast to the decision version which is even in P for all three problems. 

Lemma 21. p-star-#ACQ, p-var-#ACQ and p-#ACQ are all #W[l]-hard. 

Proof. We reduce p-^^DirPath, i.e. counting of paths of length k, to #ACQ on stars. 
With the #W[1] -hardness of p-#DirPath |FG04j the result will follow. The basic obser- 
vation is that there are \E\^ ordered choices of k edges with repetitions. Thus it suffices 
to count the number of choices that are not paths to compute the number of fc-paths in 
a graph. A choice ei, . . . ,efc is not a /c-path, if and only if it has one of the following 
defects: 
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1. It is not a walk, i.e. there is an i such that has as end vertex not the start vertex 
of Cj+i, or 

2. a vertex is visited twice. 

We will encode these properties into a #ACQ-instance of polynomial size whose hyper- 
graph is a fc-star. 

So let G = {V,E) be the input in which we are supposed to count A;-paths. We 
construct a #ACQ-instance $ which has the variables yi, . . . ,yk, z. The have the 
domain V xV, while z has the more complicated domain {0, 1} x [A;] x y x F x [fc] x [k] x V. 
Observe that the domains have polynomial size and all constraints will be binary, so $ 
has polynomial size in n and k. 

For each i E [k] we add a binary constraint Ei in the variables yi,z. Out of these we 
build the formula 

k 

V ■■= /\Ei{yi,z). 

i=l 

The yj-variable will choose arbitrary potential start and end points of an edge. The role 
of the ^-variable is to guess one of the defects described above that prevents the chosen 
vertices from describing a path. We systematically describe tuple set for E^. The first 
component of an assignment can only take or 1 and encodes if z guesses either a defect 
in the walk structure or a double variable. 

• z may guess that the end vertex of is v while the start vertex of ej+i is u for 
u ^ V. It does so by taking the value {0,i,v,u,ji,j2,a) for arbitrary ji,j2,a.. If 
this guess is true, then yi must have chosen an edge that indeed does end in v. So 
we add the tuples {{{b,v),{Q,i,v,u,ji,j2,a)) \ a,b,v,u G V,{b,v) G E,ji,j2 e [k]}- 

• The second defect that z may guess is that the edges do not form a path, because 
the end vertex of ej_i is v while the start vertex of is u. This results in the 
tuples {{{u,b),{0,i - l,v,u,ji,j2,a)) \a,b,v,ue V,{b,v) G E,ji,j2 e [k]}- 

• If 2; predicts a defect preventing a walk in some other place, Ei does not have to 
check this, so we accept everything. The resulting tuples are {{{u,v), {0,j, c, d, ji,j2, 
a,c,d,v,u G V, {u,v) G E,ji,j2 G [k],j ^ {i — 1,^}}- These cover all cases of the 
edges not being a path. 

• If z guesses that the vertex v occurs at two different places in the potential path, 
it does so by specifying edges ej-^ , ej^ with ji < j2 such that the start vertex of Cj^ 
and the end vertex of e^^ is v. It does so by taking a value {l,£,a,b,ji,j2,v) for 
i,a,b arbitrary. If f ^ {ji,j2} then Ei does not have to check for an effect and 
accepts if yi encodes an edge. Thus we add the tuples {{{u,w), {l,i,a,b,ji,j2,v) \ 
a,b,u,v,w G V, {u,w) G E,i,ji,j2 € [k],i ^ {ji,i2}}- 

• 11 z guesses a double occurence of v and i = ji we accept only if that guess is 
correct and j2 > i. So we add the tuples {({v,w),{l,£,a,b,i,j2,v) \ a,b,v,w G 
V, {v,w) G E,£,j2 G [k]}. If i = j2 we add analogous tuples. 
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It is easy to see that <^ accepts assignments to the yi if and only if each gets the 
end points of an edge in G and there is a defect that prevents the edges from being a 
path. Thus the number of satisfying assignments of $ is the number of ordered choices 
of edges in G with repetition that are not paths. This completes the proof. □ 

5.5 Bounded quantified star size is necessary 

In this section we show that quantified star size is in a sense the only restriction that 
makes #ACQ tractable. Not only does bounded quantified star size give tractable 
instances, but the other way round under a standard assumption from parameterized 
complexity all classes of tractable #ACQ-instances must have bounded quantified star 
size. This is somewhat similar to the results of Grohe et al [GSSOlj who proved that 
under reasonable assumptions the only polynomial time decidable subclass of bounded 
arity CQ is the class of bounded treewidth. 

As we have seen in the previous sections, not only the hypergraph of the input for- 
mula is decisive for tractability but also the structure of the quantified variables in this 
hypergraph. We formalize this in the following definition. 

Definition 22. An S -hypergraphs is a pair {71,3) where % = iV-iE) is a hypergraph 
and S 'Z V . We say that #ACQ is tractable for a class Q of S -hypergraphs if for all 
#ACQ instances $ with the associated hypergraph Ti of ^ and the set S of free variables 
of <I> with {71,8) ^ Q we can solve 7^ ACQ in polynomial time. 

Example 23. Let Q be the class of acyclic S -hypergraphs of S-star size bounded by k. 
Then the result of Theorem [721 can be expressed as "#ACQ is tractable for Q ". 

We will use the fact that #ACQ is already hard for very restricted /S-hypergraphs, 
namely for stars in which only the center is not in S. We call this class Qs- Observe 
that the proof of Lemma [21] gives the following Lemma. 

Lemma 24. #ACQ is #W[1] -/lard for Qs parameterized by the size of the stars. 

We now show the main result of this section. 

Theorem 25. Assume FPT ^ #W[1], and let Q be a recursively enumerable class of 
acyclic S -hypergraphs. Then ^ACQ is polynomial time solvable for Q if and only if Q 
is of bounded S-star size. 

Proof. One direction of the claim is Theorem [18l For the other direction assume that 
there is a class G of unbounded S-star size such that #ACQ is tractable on Q. We show 
that in this case 7^ ACQ on Gs parameterized by the star size is in FPT and with Lemma 
[Ml we get FPT = #W[1]. 

So all we have to do is to construct a fixed parameter algorithm for #ACQ on Gs- 
Let ^> be an instance of this problem, i.e. has the formula (p := f\^^^ Ei{yi,z). Let 
the domain of $ be D. Because G is recursively enumerable and of unbounded S-star 
size, there is a computable function (7 : N — > N such that for A: G N one can compute 
{Ti, S) £ G such that Ti is of S-star size at least k in time g{k). We will embed <I> into T-L to 
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construct an #ACQ-instance \I' of size g{k)n^^^^ where n is the size of <I>. Furthermore, 
^ will have the S'-hypergraph "H and the same domain as 

Let 7i' = {V, E') be an S-k-star in H that is formed hy Y = {yi, . . . , y^}, y C 5. For 
each edge e & E we define a constraint Eg. Let first e & E' he an edge that contains yi 
for some i € [k], then Eg has as variables the vertices of e. Let i/j be the first variable of 
Ee followed by the other variables in enS" and after those the variables in e\5. Then Eg 
has the tuples {{a,d, . . . ,d,b, . . . ,b) \ {a,b) £ TZi}, where TZi is the relation of Ei and d is 
an arbitrary but fixed value in D. Observe that this forces all variables in (eD S)\ {yi} 
to the variable d in satisfying assignments, while the variables in e\5' all have a common 
value b. Furthermore, observe that no two of the yi share an edge in E' , so E^ is always 
well defined. 

Let e (z E' with eOY = 0. Again we define a constraint E^. Let in E^ the first variables 
be those in S" n e followed by those in e \ S*, then Ee has the tuples {{d, . . . , d, a, . . . , a) \ 
a G D} for the same d (z D as before. Again in the satisfying assignments all variables 
in e n S are forced to d, while the variables in e \ can take an arbitrary but equal value. 

For e G E \ E' with e CiV = 0, we add a constraint Eg in with the single tuple 
{d, . . . ,d). e €z E \E' with e n y 7^ 0, we have e n y C 5. Furthermore at most one 
vertex in e fl V' , say yi, can be in Y, because Y forms a star. If there is no such yi, we 
construct a constraint Ee with the only tuple {d, . . . ,d). If there is yi £ e, we construct 
a constraint E^ in which yi is the first variable with the tuples {(a, d, . . . ,d) | a E D}. 

These are all constraints of ^. Let the formula 

V'' := A 

and i{j the formula that we get from ip' by quantifying all variables in y \ S". Let ip 
he the formula of ^, then it is easy to see that ^' has has the associated 5-hypergraph 
{T-L,S). Furthermore, ^ has the same number of satisfying assignments as <I>. This is 
because in each satisfying assignment a of ^' all variables in {V \ V) U {S \ Y) are set 
to d. Furthermore, all variables in 1/' \ S take one common value b in a. Let a' be an 
assignment to <^ that we get by setting a'{z) := b and a'(yj) := a(yj). It is easy to see 
that by this construction the satisfying assignments of ^ and $ correspond directly, so 
the number of satisfying assignments is the same. 

Now assume that #ACQ is polynomial time solvable for Q. It follows that the satis- 
fying assignments of ^ can be counted in time {g{k)n)^^^^ and thus #ACQ on Qs is in 
FPT. With Lemma [2H this contradicts the assumption which completes the proof. □ 

6 Applications to arithmetic circuit complexity 

We now show an adaptation of some of the results in this paper to arithmetic circuit 
complexity, i.e. the so-called Valiant model ( |Val79j ) . A polynomial family is a sequence 
(/„) of multivariate polynomials over a field F. The class of polynomial families of poly- 
nomial degree computed by families of polynomial size arithmetic circuits is denoted by 
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VP. This class is a natural candidate to formalize efficient computation with arithmetic 
circuits. A family (/„) of polynomials is in VNP, if there is a family (gn) € VP and 
a polynomial p such that fn{X) = J2ee{o for all n where X denotes the 

vector {Xi, . . . , for some polynomial q. By definition VP C VNP but the precise 

relations between VP and VNP are still unknown. It is however widely conjectured 
that VP 7^ VNP, i.e. that not all polynomial familes in VP are efficiently computable 
by arithmetic circuits. 

A polynomial / is called a projection of g (symbol: / < 5), if there are values Oj G 
F U {Xi,X2, ■ . .} such that f{X) = g{ai, . . . , Og). A family (/„) of polynomials is a p- 
projection of (gn) (symbol: (/„) <p (gn)), if there is a polynomial r such that /„ < gr[n) 
for all n. As usual we say that {gn) is hard for an arithmetic circuit class C if for every 
(fn) € C we have (/„) <p (gn)- If further (gn) € C we say that (gn) is C-complete. 

Classes of polynomials are often better understood through the natural polynomial 
families they contain or, better, which are complete for them. For example, it is well- 
known that determinant family, denoted (Det^), of matrices (-^i,j)i,je[n] defined by: 

n 

BetniXij) = ^ sgn(cr) JJXi_^(j). 

cr&Sn « = 1 

where sgn(o") E {—1, 1} is the sign of the permutation a, is contained in VP (although 
presumably not complete). Similarly, the polynomial family (Perm„) representing the 
permanent of the matrices {Xij)^^^^^] defined as 

n 

Perm„(Xij) = ^ 

has been shown to be VNP-complete (see |Val81j ). Recently, the complexity of poly- 
nomial families defined by natural extensions of counting solutions to constraint sat- 
isfaction problems (like the the Q-polynomial of this paper) has been investigated (see 
[BKMllllMenllj and also [Brillj ) and several non-circuit characterizations of VP, VNP 
and other classes have been obtained. In this section, we generalize some of these re- 
sults by showing that acyclic conjunctive queries can define polynomial families that 
characterizes VP and VNP. 

6.1 Tractable polynomials 

The weighted counting problem #fACQ can be seen as the problem of computing the 
polynomial 

Qi^)--= E n 

in the variables {X^ \ d € D} for a $ in CQ. This naturally makes #fACQ or equiva- 
lently computing Q(<&) a question in the Valiant model. We have: 

Theorem 26. // ($ n) is a family of ACQ of polynomially bounded size and bounded 
quantified star size, then (Q(<I>„)) G VP. Moreover, any family in VP is a p-projection 
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of {Q{^n)), where the $ n are polynomial size quantifier free conjunctive queries whose 
hypergraph is a tree. 

Proof. The upper bound follows from the proof of Theorem (TU] and Theorem [31 The 
lower bound is already true for acyclic queries on graphs and follows from [Menllj . □ 

This result shows that while ^^fACQ is a tractable counting problem, it is probably 
harder than computing the determinant, which is quite rare in counting complexity (see 
|MP08j for the role of the determinant in the Valiant model). 

As a corollary, we also obtain the following result from [Menllj . 

Corollary 27. Letk,d be integers. For any family (<I?n) o/CSP of polynomially bounded 
size and arity bounded by d and tree-width bounded by k, then (Q(<^„)) G VP. 

Proof. Comes from the fact that any CSP of tree- width k built on relation of arity d can 
be transformed (by taking joins of atoms in each bag of the tree decompositions) in time 
0(n-^(^''^)), for some function /, into an acyclic CSP with the same set of solutions. □ 

6.2 The power of existentially quantified variables 

In this section we show a version of Pichler and Skritek's hardness result for ^ACQ for 
the Valiant model, i.e. the polynomial Q($) appears to be harder to compute than in 
the unquantified case. We state the upper bound in a more general way: 

Let D he a finite set with \D\ = d. For a positive integer n we encode an elements 
of a = (ai,...,a„) € -D" by a d x n-matrix M = (w.jj)jg£) jg[„] such that ruij = 1 
if Oj = i and niij = otherwise. Observe that a 0-1-matrix encodes an element in 
D'^ if and only if in each column there is exactly one 1. We define the monomial 
q{M) := q{a) = Hjeln] -^aj ^ encodes a and q{M) := otherwise. With this notation 
we can prove a version of Valiant's classic criterion |Val79j for the Q-polynomial. 

Proposition 28. Let a : {0, 1}* — )• Z a function in gapP. Furthermore let (Dn) be a 
polynomially bounded family of sets and let p{n) be a polynomial. Then the family (fn) 
defined by 

fn= a{M)q{M) 

MG{0,l}l-f'"lxj'(") 

is in VNP. 

Proof. It is folklore that a can computed as a{x) = J2ee{o ij^'f") 9\x\{xi^) where r is a 
polynomial and gn is a family of uniform arithmetic formulas of polynomial size (see 
e.g. |BF91| ). In the proof of Lemma 4 in the full version of [ Menllj it is shown how to 
compute the function q with small arithmetic formulas. Combining this we directly get 
the proposition. □ 

Proposition 29. // (<I> n) is a family of acyclic conjunctive queries of polynomial size, 
then (Q($„)) € VNP. Moreover, any family in VNP is a p-projection of such a 
(Q($„)). The {^n) family can be supposed of arity bounded by two. 
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Proof. We start off witli tlie containment in VNP, whicli follows easily from the fact 
that given an assignment a and an acyclic conjunctive query $ one can in polynomial 
time decide if the a satisfies Applying Proposition [28] we get the upper bound. 

The hardness is obtained by reduction from the family (Perm„). Perm„ can be seen 
as the sum of weights of perfect matchings in the weighted bipartite graph K^ ji. 

Let Kn^n = U -B,i?) be the complete bipartite graph with A = {ai, ...,a„}, B = 
{61, hn}- We denote by e^j- the edge between vertices ai G A and bj £ B. We construct 
a structure S = {U, F, G, H, I, J, K,ai, an) of domain U as follows. 

• The universe U = AU B U {eij \ i,j £ [n]} U {p, n, I}, 

• G = {p,n}, 

• F = {{n,l)}U{{p,ei,j) \i,j e [n]}, 

• H = {{eij,ai) I i,j e [n]} U {(/,ai) [ i G [n]}, 

• I = {ip,l)}U{{n,bi) I i E [n]}, 

• K = {{bj,ei^k) ■■ i,j,k G [n],j / k}U{{l,l)} 

Note that the maximal arity of a predicate is two. Let (p{xi, x„, x, x'^, x^) be the 
following acyclic conjunctive query: 

n n n n 

G{x) A l\ F{x, Xi) A /\ H{xi, Oi) A 3yl{x, y) A f\ K{y, x',) A /\ H{x'i, ai) (1) 

1=1 i=l i=l i=l 

There are two types of satisfying assignments: 

• If 3; takes the value p, then y and all x[ must take the value I. The Xi take as values 
the edges eij in such a way that for each i G [n] there is an edges eij. Thus the 
vertices in A are mapped to the vertices of B in the original graph in an arbitrary 
way. 

• If X takes the value n, then all Xi must take the value I. Furthermore y takes a 
value b £ B. The x'^ then take as values the edges eij in such a way that each Oj 
is mapped to a vertex bj £ B \ {b} by this edge. Thus the assignment to the x[ is 
an arbitrary non-injective assignment of the vertices in A to those in B. 

Thus the query <I> = (S, (j)) defines a polynomial Q{^) with the following property 
(with Cij corresponding to variable Xij, p to Xp, n to X„ and / to X/). 

Qi^) =E n ^'^w 

tGvar($) 

n n 
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Projecting correctly, we get 

PevuiniXij) = Q($)(Xi,...,X„,Xp,X„,X«)|xp=i,x„=-i,Xi=i. 

□ 

6.3 Unions and intersections of ACQ 

We now show that a version of Proposition O is also true for the Valiant model. 

Proposition 30. //(<!>„,) is a family of queries of polynomial size that are conjunction 
(resp. disjunction) of two acyclic conjunctive queries, then {Q{^n)) G VNP. Moreover, 
any family in VNP is a p-projection of such a The result remains true for the 

polynomial family (P(<I>„)). 

Proof (Sketch). The upper bound follows directly from Proposition [55J The proof of the 
lower bound for conjunction of acyclic queries follows directly as Proposition [5j The case 
of disjunction is obtained by reduction from the case of conjunction. Let ^ = (5, (/>(x)) 
and ^ = {S,ip{x)) be two acyclic conjunctive queries. W.l.o.g. we can suppose they 
both are on the same structure S of signature a and domain D. We denote by <^ A ^' 
the instance (iS,(/)(x) A tp{x)) and by <I> V ^' the instance (5,(/>(x) V V'(x)). Let S' be a 
new structure of domain D' = DU {ai, 02, 03} where ai,a2, as are not in D. Structure 
S' includes S and is equipped with two new unary relations R and S which are defined 
as follows: 

R = {ai,a2}, S = {02, as}. 
Let us now consider the following disjunction of two acyclic formulas: 

V9(x,y) ^ (</.(x) A R{y)) V (V(x) A S{y)) . 
The query problem T = {S' ,(p{x,y)) has the following tuples as solutions: 

• (a, ai) for a G 0(5). 

• (a, as) for a G il^{S). 

• (a,a2) for a G 0(5) U V(5). 

Then, associating each value a^ with variable Yf. 

Q(T) =Y,Q{^) + Y:iQ{^) + Y2Q{^y^). 
By projection, we get 

Q{<^ A ^) = Q(T)(X, Yi,Y2, 13)ln=i,y2=-i,y3=i- 

This shows that polynomials obtained by disjunction of two acyclic queries can repre- 
sented as projections of polynomials obtained by conjunction and hence this is true for 
all polynomial families in VNP. □ 
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7 Conclusion 



We have presented a complete picture of tractability for weighted #ACQ. However, 
there are still many open questions that could be explored in the future. 

The first question is how to generalize our results from acyclic to more general classes 
of conjunctive queries. While it should be possible to generalize the unquantified case 
to, say, bounded hypertree width (and indeed Pichler and Skritek [PSllj sketch this for 
unweighted counting), the quantified case is less clear. The generalizations of quantified 
star size for e.g. bounded treewidth queries and thus the counting algorithm of Theorem 
[TBI appears straightforward. The real problem would then be if there is an efficient 
algorithm to decide for a combination of quantified star size and treewidth similarly to 
Theorem 1181 For a fixed tree decomposition computing the star size would be easy, but 
ruling out other tree decompositions with smaller quantified star size is at least non 
obvious to us. 

One future direction of work could be trying to apply our results to combinatorial 
counting problems. Conjunctive queries are a very versatile in encoding other problems, 
so can our results be used to find non-obvious algorithms for such problems? 

We now turn to aspects from parameterized complexity. The first question is com- 
pletely determining the complexity of the parameterized problems in Section 15.41 For 
the p-#ACQ it is easy to see that (by complementing the formula) the problem can be 
reduced to counting solutions of Hi-formulas. These formulas define the class #A[2] 
and thus we have p-^^ACQ G 7^A[2]. Observe that we do not use the acyclicity in this 
reduction at all which could make the problem easier (and does indeed in other settings). 
So it would not be very surprising if we could show p-^ACQ € #W[1] . For p-var-^^ACQ 
containment in ^W[P] is straightforward, but we do not see how to bring this down 
into the #A- or the ^^W-hierarchy (see |FG06j and also the discussion in [Thu06j for a 
definition of these classes). It would not even be surprising to show that p-var-^ACQ 
could be reduced to p-#ACQ. Intuitively, too many quantified variables for few free 
variables should not make too much sense, which could then lead directly to a bound of 
the size of formulas in the number of free variables. Finally, for p-star-^^ACQ not even 
an 7^W[P] upper bound is apparent. 

Our Theorem [25] strongly depends on the fact that we can in the construction of 
hard instances use as many different relation symbols Eg as we need. Can we show a 
stronger version in parallel to the result of Grohe et al. [GSSOl] that holds for any fixed 
vocabulary that contains at least one binary relation symbol? 

Another question is if there are any structural subclasses of ^ACQ that allow more 
efficient counting than the algorithm of Theorem [T6j Theorem [25] tells us that these 
classes would be of bounded quantified star size, but could we add more structural 
restrictions to yield a class that allows, say, fixed parameter counting? Or can we under 
plausible assumptions rule out such classes in the style of |Mar07j ? 
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